Variability of Contact Process in Complex Networks 
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We study numerically how the structures of distinct networks influence the epidemic dynamics in contact 
process. We first find that the variability difference between homogeneous and heterogeneous networks is very 
narrow, although the heterogeneous structures can induce the lighter prevalence. Contrary to non-community 
networks, strong community structures can cause the secondary outbreak of prevalence and two peaks of vari- 
ability appeared. Especially in the local community, the extraordinarily large variability in early stage of the 
outbreak makes the prediction of epidemic spreading hard. Importantly, the bridgeness plays a significant role 
in the predictability, meaning the further distance of the initial seed to the bridgeness, the less accurate the pre- 
dictability is. Also, we investigate the effect of different disease reaction mechanisms on variability, and find 
that the different reaction mechanisms will result in the distinct variabilities at the end of epidemic spreading. 

PACS numbers: 05.40.Fb,()5.60Cd,89.75.Hc 



The variability of outbreaks is defined as the relative 
variation of the prevalence. In order to assess the accu- 
racy and the forecasting capabilities of numerical models, 
the variability of outbreaks has been investigated in many 
studies. In numerical models, many factors such as net- 
work structures, travel flows, and initial conditions can af- 
fect the reliability of the epidemic spreading forecast. Re- 
cently, a contact process model with identical infectivity 
is proposed to study both dynamical processes and phase 
transitions of epidemic spreading in complex networks, 
but the predictability of the model is totally overlooked. 
In this paper, by investigating the variabilities of contact 
process in distinct networks, we show numerically that the 
bridgeness plays a significant role on the predictability of 
the epidemic pattern in community network, meaning the 
further distance of the initial seed to the bridgeness, the 
less accurate the predictability is. Hopefully, this work will 
provide us further understanding and new perspective in 
the variability of contact process in complex networks. 



I. INTRODUCTION 

The great threat of epidemic spreading to human society 
has been strongly catching scientists' eyes 0I2]- In order 
to realize the impact of diseases and develop effective strate- 
gies for their control and containment, the accurate mathemat- 
ical models of epidemic spreading are the basic conceptual 
tools BlrtH]. In mathematical models, the dynamical patterns 
of epidemic spreading will be influenced by many different 
factors such as the age and social structure of the popula- 
tion, the contact network among individuals, and the meta- 
population characteristics [5]. Especially the heterogeneity of 
the population network ||6J] can result in the absence of en- 
demic threshold when the population size is infinite and the 
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exponent of degree distribution 7 < 310-Qj]]. With the further 
study, the local structures of complex networks (such as de- 
gree correlation, clustering coefficient, community structure 
and so on) bring quantitative influences on epidemic spread- 
ing |[l2T - [l4[l . Considering the complicated local structures in 
real networks, the forecasting capabilities (i.e. variability) 
of current numerical models have been investigated |15|. In 
addition, both the stochastic nature of travel flows 1 16 , [TtI] 
and initial conditions can affect the reliability of the epidemic 
spreading forecast |[T8l - l24ll . 

In view of this point, Colizza et al. have studied the ef- 
fect of the airline transportation network on the predictability 
of the epidemic pattern by means of the normalized entropy 
function Hill , and found that the heterogeneous distribution 
of this network contributes to enhancing the predictability. In 
complex networks, many factors can decrease the forecasting 
accuracy of epidemic spreading. Crepey et al. have found 
that initial conditions such as the degree heterogeneity of the 
seed show a large variability on the prediction of the epi- 
demic prevalence, and the infection time of nodes have non- 
negligible fluctuations caused by the further distance and the 
multiplicity of paths to the seed Comparing the scale- 
free network (SFN) with community structure I125t - l30ll with 
the random SFN, the predictability of the prevalence can be 
found to be better 113111 . 

The common assumption in all the aforementioned works is 
that each node's potential infection-activity (infectivity), mea- 
sured by its possibly maximal contribution to the propagation 
process within one time step, is strictly equal to its degree. 
However, there are still many real spreading processes which 
can not be described well by this assumption ll32ll . There- 
fore, a contact process (CP) model with identical infectivity 
is proposed to study the epidemic spreading in complex net- 
works [33]. Almost all studies in CP are focused on dynamical 
processes and phase transitions [32-42], but the predictability 
of the model is totally overlooked. To this end, we study how 
the structures of distinct networks (i.e. homogeneous, het- 
erogeneous and community networks) influence the variabil- 
ities of epidemic patterns in CP. Through numerical experi- 
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ments, we find that the community structures can remarkably 
influence the prevalence and its variability, contrary to non- 
community networks (i.e. homogeneous and heterogeneous 
networks). It is worth noting that it's hard for the extraor- 
dinarily large variability in a local community to predict the 
epidemic prevalence. 

This paper is organized as follows. In Sec. II, we briefly de- 
scribe disease models in CP in complex networks and provide 
quantitative measurements of the predictability of epidemic 
spreading. In Sec. Ill, we investigate the prevalence variabil- 
ities in both random graph (RG) flU] and SFN Q. In Sec. 
IV, we discuss the essential differences of the prevalence vari- 
abilities both in the global network and the local community. 
Finally, we draw conclusions in Sec. V. 



n. CP MODEL IN COMPLEX NETWORKS 

In our model, three distinct networks, i.e. the homoge- 
neous, heterogeneous and community networks are adopted 
to investigate the predictability of epidemic spreading therein. 
Firstly, as the mother of all network models, the random graph 
of Erdos and Renyi B43I1 is regularly used in the study of com- 
plex networks because networks with a complex topology and 
unknown organizing principles often appear randomly ll45ll . 
Random graph is defined as a graph with TV nodes and con- 
nection probability p, which has a Poisson distribution. Sec- 
ondly, since scale-free property is observed in many real com- 
plex systems, dynamics study on scale-free networks have 
been holding everyone's concern. In 1999, Barabasi and Al- 
bert (BA) put forward the most classical SFN model which 
is rooted in two generic mechanisms: growth and preferen- 
tial attachment [44]. As there are community structures in 
social networks, the last studied structure substrate is com- 
munity network |46]. Here we will adopt a simplified com- 
munity model proposed by Liu and Hu, which emphasizes on 
the community feature in social networks [25]. For simplicity, 
two independent random graphs are first produced, and then 
two RGs are connected randomly by only one link. 

In general, the standard disease models conclude 
susceptible-infected (SI), susceptible-infected-susceptible 
(SIS), and susceptible-infected-refractory (SIR) epidemiolog- 
ical model. Each node of the network represents an individual 
and each link plays as one connection which transmits disease 
to other node. In SI (SIS or SIR) model, 'S', T and 'R' 
represents respectively the susceptible (healthy), the infected, 
and the refractory (recovered) state. At each time step of 
contact process, each infected node randomly contacts one of 
its neighbors, and then the contacted neighboring node will be 
infected with probability A if it is in the healthy state, or else 
its state will stay the same. At the same time, each infected 
nodes is cured and becomes susceptible (refractory) with rate 
fi in SIS (SIR) model. To eliminate the stochastic effect of 
the disease transmission, we can set A = 1 and \i = 0.2. 

In order to analyze the effect of the underlying network 
topology on the predictability of epidemic spreading, the vari- 
ability of outbreaks is defined as the relative variation of the 
prevalence [density of infected individuals i(t)] given by lfl9ll 



= ^-jm — • (1) 

A[z(t)] = denotes all independent dynamics realizations 
are essentially the same, and the prevalence in the network is 
deterministic. Larger A[i(i)] means worse predictability that 
a particular realization is far from average over independent 
realizations. 



III. PREDICTABILITY IN HOMOGENEOUS AND 
HETEROGENEOUS NETWORKS 

The first issue of our study is how the heterogeneity of net- 
work structures influences the variability of the prevalence in 
CP. By using a numerical approach in this section, we analyze 
the variabilities of outbreaks generated by different sets of ini- 
tial nodes, both for random graphs and scale-free networks 
with the same network size and average degree. Considering 
the fact that the results of a particular network can be gener- 
alized to any instances of network model 11911 . the numerical 
simulations we studied here are run in one network. In Fig. 
1, we show the curves i(t) and A[i(t)] computed for the dif- 
ferent disease models in both RG and SFN. For SI model in 
Fig. 1 (a), the density of infected i(t) in RG reachs its station- 
ary state faster than that in SFN; for SIS model in Fig. 1 (b), 
the stationary i(t) in RG is greater than that in SFN; and for 
SIR model in Fig. 1 (c), RG has the higher peak prevalence. 
Contrary to the results for the case of contacting all neighbors, 
it is first discovered that the heterogeneous structure can slow 
down the prevalence of outbreaks in CP. Because hubs may 
be contacted many times by their neighboring nodes at each 
time step, the total contact ability (i.e. the actual number of 
contacting nodes at one time step) of SFN is reduced further 
accordingly, as a result, the hub effect holds back the preva- 
lence of diseases. Meaningfully, owing to the limited contact 
ability of CP, the infected densities starting from the initial 
infected nodes (seeds) with different degrees are almost the 
same in SFN, which is distinct from the results for the case of 
contacting all neighbors lfl9ll . 

As shown in Fig. Q] (d), (e), and (f), there is slightly dif- 
ferent between variabilities in RG and SFN when t < 20, 
which implies heterogeneous structure does not visibly alter 
the predictability of CP before the outbreak of disease. An im- 
portant contribution of this study is to analyze the differences 
among the variabilities of three kinds of disease models. From 
the comparison among them, we find that different recovery 
mechanisms can result in distinct variabilities at the end of 
epidemic spreading. For SI model in Fig. 1 (d), the time ar- 
riving at i(t) = 1 varies in a mass of realizations, which can 
induce an exponential decay of the variability when t > 30. 
For SIS model in Fig. 1 (e), the variability of the prevalence 
will keep on a steady value in stationary state. For SIR model 
in Fig. 1 (f), due to the different lifetimes of the epidemics in a 
mass of independent realizations lfl8tl . the greater and greater 
variabilities are observed by approaching the end of the epi- 
demics. 
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FIG. 1: (color online). Evolution of both i(t) and A[i(f)] for the 
different disease models where the "triangles" and "circles" denote 
the cases of SF and RG networks with the random initial seeds. i(t) 
versus t for SI model (a), SIS model (b), and SIR model (c), and 
A[i(t)] versus t for SI model (d), SIS model (e), and SIR model (f). 
The parameters are chosen as N — 0.5 x 10 4 , (k) = 10, A = 1, 
and fi = 0.2. The results are averaged over 2 x 10 4 independent 
realizations in one network. 
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FIG. 2: (color online). Evolution of both i(t) and A[i(t)] in com- 
munity networks where the "squares", "circles", "triangleups", "tri- 
angledowns", and "trianglelefts" denote the cases of the bridgeness, 
d — 1,2, 3, and 4, respectively. i(t) versus t for SI model (a), SIS 
model (b), and SIR model (c), and A[i(t)] versus t for SI model (d), 
SIS model (e), and SIR model (f). The parameters are chosen as 
N = 10 4 , (k) — 10, A = 1, and ^ — 0.2. The results are averaged 
over 2 x 10 4 independent realizations. 



IV. PREDICTABILITY IN THE GLOBAL NETWORK AND 
THE LOCAL COMMUNITY 

A. Global network 

As many social networks combined by several communi- 
ties, such as Facebook 14711 . YouTube 114 811 . and Xiaonei 14911 . 



information propagation taking place in community net- 
works I25I - I29II is one of the most important subjects studying 
in complex networks, but in CP, the related research has been 
ignored for a long time. Therefore, in this section, we study 
the variability of CP in a very simple community network, 
where two RGs are connected randomly by only one link. 
Obviously, this network has a strong strength of community 
structure. In order to normalize the terms of community net- 
work, we define the link as weak tie [50], and call two nodes 
connected by this link "bridgeness" [51]. We first investigate 
the time evolution of epidemics generated by different seeds 
staying away from the bridgeness, and it is noted that there 
is only one initial seed in each realization. From Fig. 2 (a), 
(b), and (c), we can get that the closer the seed to the bridge- 
ness,the epidemic spreads much faster in the global network, 
and among all cases, the epidemic starting at the bridgeness 
spreads fastest. For SI model, the further distance to the brid- 
geness such as d = 3, 4 induces two periods of the quickly 
rising trend at the beginning time t = 10, 30, respectively. If 
the initial seed is far away from the bridgeness, the disease 
will be restricted in the first community for a long time till 
the bridgeness infected, in which almost all nodes is infected. 
As a result, the outbreak in the second community just starts 
at that moment the prevalence in the first community get to- 
wards the end, which causes the second outbreak. In the case 
of SIS and SIR model, the recovery mechanism reduces this 
phenomenon occurred, for instance there is the tiny second 
peak of the prevalence for d = 3, 4 in Fig. 2 (c). From the 
above, it is found that the bridgeness plays a distinctly impor- 
tant role in the rapid transmission of information in CP. 

The variability of prevalence in community network is dis- 
tinct from that in the network which has no community struc- 
ture. As shown in Fig. [2jd), (e), and (f), the curves A[i(t)] 
display two peaks because of the time delay between two out- 
breaks occurred in different communities. In addition, the fur- 
ther distance to the bridgeness makes the second peak occur 
much later. In Fig. EJd), for SI model, the first peak corre- 
sponds to the prevalence in the community with the initial 
seed, so the variability is almost the same as that in RG before 
t = 10. With the outbreak in the second community, the sec- 
ond peak occurs. Owing to the greater randomness of the time 
that disease first occurs in the second community (see Fig. |5J, 
the second peak of the variabilities is slightly greater than the 
first peak. After the infection density is close to saturated at 
1 « 40 (see Fig. |2ja)), the variability will be on exponen- 
tial decay. As all nodes of community network are infected in 
more and more realizations, the variability A[i(t)} will rapidly 
decay to zero. In contrast with the case of SI model, the sec- 
ond peak in SIS model is less than the first peak, because the 
recovery mechanism slows down the propagation velocity of 
diseases, which reduces the variability of the prevalence. That 
is to say, the recovery mechanism reduces the variability of 
epidemic spreading. Another extremely obvious difference 
is the variability decreases to a steady value at the stationary 
state. For SIR model in Fig. Eff), as time goes by, the epi- 
demics has the greater and greater variability, which is caused 
by the different lifetimes of the epidemics in 2 x 10 4 indepen- 
dent realizations. 
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FIG. 3: (color online). The distribution of arrival time of disease 
in the second community for SI model (a), SIS model (b), and 
SIR model (c), where the "squares", "circles", "triangleups", "tri- 
angledowns", and "trianglelefts" denote the cases of the bridgeness, 
d = 1,2,3, and 4, respectively. The results are averaged over 2 x 10 4 
independent realizations. 



B. The local community 

Considering the relative independence of a local commu- 
nity, we should take the prevalence and variability into ac- 
count. On the other hand, since disease must be transmitted 
through bridgenesses from the first community to the second 
community, this study contributes to understand the effect of 
them on epidemic spreading 15211 ■ In this section, we will 
specifically analyze the effect of different distances of seeds 
(to the bridgeness in the first community) on epidemic spread- 
ing in the second community. 

At first, the arrival time of disease is defined as the moment 
that infectious individual first occurs in the second community 
in each realization, thus the distribution of arrival time is ob- 
tained through massive realizations. In Fig. 3, the distribution 
of arrival time for the different initial seeds is showen. For SI 
model, the arrival distribution of the bridgeness as seed d = 
strictly obey the distribution P(t) = A(l - A)* -1 . When dis- 
ease seed is the node with one step to bridgeness, the arrival 
time increases generally, and the distribution becomes much 
wider and flatter. With the further increasing of distance of 
seed to the bridgeness (such as d = 3, 4), the distributions are 
nearly the same. It is understood that due to the finite size 
effect of network, the disease is transmitted through weak tie 
to the second community till overall outbreak happened in the 
first community. For SIS model, as a result of the recovery 
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FIG. 4: (color online). Evolution of both i(t) and A[i(£)] in the sec- 
ond community where the "squares", "circles", "triangleups", "tri- 
angledowns", and "trianglelefts" denote the cases of the bridgeness, 
d = 1,2, 3, and 4, respectively. i(t) versus t for SI model (a), SIS 
model (b), and SIR model (c), and A[i(t)] versus t for SI model 
(d), SIS model (e), and SIR model (f). The results are averaged over 
2 x 10 4 independent realizations. 



mechanism, the distributions of arrival time are much more 
evenly and smoothly than that for SI model, given the vari- 
ous initial seeds. Compared with Fig. [3] (a), we can find that 
there are two peaks for SIS and SIR model with d = 1. This 
is because the bridgeness may be infected through two basic 
pathways: the bridgeness may be infected directly by the ini- 
tial seed (i.e. its neighboring node) in t < the other 
route is the transmission of infection from the other neighbor- 
ing nodes when the disease outbreak in the first community, 
thus the second peak occurs at t m 20 (see Fig. |2jb)). For 
SIR model, owing to the recovery mechanism, the prevalence 
might well disappear in the first community before arriving at 
the weak tie. Consequently, arrival rate (i.e. the area of the 
distribution) is less than one, and the peak value of the cor- 
responding distribution is less than that for both SI and SIS 
model. 

Fig. |4] shows the prevalence and variability in the sec- 
ond community which are generated from the different initial 
seeds in the first community. From Fig.|4](a), (b), and (c),we 
can easily know that the prevalence of the bridgeness acting as 
seed increases much faster than that in the other cases gener- 
ated by the further seeds. In particular, for SIR model in Fig. [4] 
(c), the peak of the prevalence (for the case of the bridgeness) 
occurs first, and has the maximum value. In addition, there 
are two peaks of the prevalence for the case of d = 1, which 
is attributed to the propagation delay between two commu- 
nities. Since there is only one interconnection between two 
communities, the chance of infecting one from the other is 
low. In other words, infection within intra-community will be 
much faster than inter-community infection. Therefore, the 
first peak reflects the outbreak and extinction of disease inside 
with infectious seed, and the second peak emerges after the 
other community is infected. Note that the two peaks can be 
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only observed in SIR model because of the fact there will be 
no peak if virus does not "die". With the increase of distances 
d, the peak value is greater than that for d = 1, although the 
outbreaks occur later. 

In Fig.|4](d), (e), and (f), it is a surprise that the variabilities 
in the second community are distinct from that in the global 
network. Firstly, the variabilities in the second community 
are very large and also much greater than that in the global 
network in Fig. [2] which implies the huge unpredictability of 
prevalence produced in the local community. In particular, 
at the beginning of outbreaks, the variability for the case of 
d = 4 reaches about 50, which is 100 times the maximum 
value 0.5 in the global network. Secondly, the closer distance 
of the seed to bridgeness, the lower level of variability it has in 
the local community. In particular, the maximum variability 
value for the case of the bridgeness is only about 1, which is 
much less than 50 for the case of d = 4. Thus, the bridgeness 
plays a significant role in enhancing the predictability, that 
the closer initial seed to the bridgeness, the more accurate the 
predictability is. Thirdly, each curve of variabilities can be 
divided into four parts: the sudden drop stage, the relatively 
stable stage, the slowing-down stage, and the final stage of 
outbreaks (i.e. the exponential decay stage for SI model, the 
steady state stage for SIS model, and the sharp increase stage 
for SIR model, respectively). The first stage is originated from 
the uncertain arrival time of disease, with the increase of the 
arrival rate, the variability decreases. The second stage (10 < 
t < 20 in most cases) is induced by the interplay between the 
outbreak and the arrival of diseases in massive realizations: 
on the one hand, the outbreaks in some realizations upgrade 
the variability; on the other hand, the increase of the arrival 
rate counteracts this effect. As the infection density is close 
to saturated, the variability will enter the slowing-down stage. 
What is noteworthy is that the minimum variability value just 
corresponds to the peak value of prevalence for SIR model. In 
the end of epidemic spreading, the variabilities A[i(t)] display 
the distinct phenomena for the different disease models, for 
instance, SIR model shows the higher and higher variabilities. 

V. CONCLUSIONS AND DISCUSSIONS 

In conclusions, we have studied the variability of CP in 
complex networks, and get the clear understanding that the 



different network structures can remarkably influence the 
prevalence and its variability. Firstly, we find that the vari- 
ability difference between homogeneous and heterogeneous 
networks is very narrow, although the heterogeneous structure 
induces a lighter prevalence. Secondly, two peaks of both the 
prevalence and variability are shown in the community net- 
work. It's noted that in the local community, the extraordinar- 
ily large variability in early stage of the outbreak makes the 
prediction of disease spreading hard. This result is in accor- 
dance with Ref. [53] in which the networks with strong com- 
munity structures are of weak synchronizability, and the am- 
plitudes of the time series in the local communities are much 
larger than that in the global networks. Fortunately, the brid- 
geness plays a significant role in enhancing the predictability, 
the closer initial seed to the bridgeness, the more accurate the 
predictability is. This result suggests that bridgenesses may 
be the ideal detection stations in community networks. More- 
over, the different reaction mechanisms of disease models can 
result in the distinct variabilities. Especially for the case of 
SIR model, the greater and greater variabilities are observed 
at the end of the epidemics for the different lifetimes of the 
epidemics in various realizations. 

The community network employed in this study is much 
more simple, but the actual community networks have com- 
plex structures, such as multifarious communities, many brid- 
genesses, and the heterogeneous degree distribution in a local 
community. Therefore, the further investigation should be fo- 
cused on the more complex community networks. 
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